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ABSTRACT 



The invention relates to methods of identifying and charac- 
terizing properties of polymers to provide information about 
the polymer such as the charge of the polymer, the number 
and types or characteristics of units of the polymer and the 
sequence of the polymers. The invention also relates to 
methods of sequencing polymers such as nucleic acids, 
polypeptides and polysaccharides and methods for identify- 
ing a polysaccharide-protein interaction. 

21 Claims, 13 Drawing Sheets 
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DOS, VMS and OS8 are examples, which controls the We claim: 

execution of other computer programs and provides 1. A method for identifying a subpopulati on of polymers 

scheduling, debugging, input/output control, accounting, having a property in common with a sample polymer of 

compilation, storage assignment, data management and chemical units, comprising: 

memory management, and communication control and 5 (A) applying an experimental constraint to the polymer to 

related services. The processor and operating system define modify the polymer, 

a computer platform for which application programs in (B) detecting a property of the modified polymer; 

high-level programming languages are written. ( C) identifying a population of polymers of chemical units 

A memory system typically includes a computer readable having the same molecular length as the sample poly- 

and writeable nonvolatile recording medium, of which a jq mer, and 

magnetic disk, a flash memory and tape are examples. The (D) identifying a subpopulation of the identified popula- 

disk may be removable, known as a floppy disk, or t ion of polymers having the same property as the 

permanent, known as a hard drive. A disk has a number of modified polymer by eliminating, from the identified 

tracks in which signals are stored, typically in binary form, population of polymers, polymers having properties 

i.e., a form interpreted as a sequence of one and zeros. Such 15 that do not correspond to the modified polymer. l3fl0f? \ f\ 

signals may define an application program to be executed by 2. The method of claim 1, further comprising repeating ^ a(Y1 p ^ 0~ 

the microprocessor, or information stored on the disk to be ste ps (A), (B), and (D) on the modified polymer to identify r^V 1 

processed by the application program. Typically, in a second subpopulation within the subpopulation of poly- noVuSACCVlfltV^ W < 

operation, the processor causes data to be read from the mers having a second property in common with the twice » J 

nonvolatile recording medium into an integrated circuit 2 o modified polymer. 

memory element, which is typically a volatile, random 3. The method of claim 2, further comprising repeatedly 

access memory such as a dynamic random access memory performing the steps (A), (B), and (D) on the modified 

(DRAM) or static memory (SRAM). The integrated circuit polymer until the number of polymers within the subpopu- 

memory element allows for faster access to the information i at i 0 n falls below a predetermined threshold, 

by the processor than does the disk. The processor generally 25 4 . Th e met hod of claim 3, wherein the predetermined 

manipulates the data within the integrated circuit memory threshold of polymers within the subpopulation is two 

and then copies the data to the disk after processing is polymers and wherein the method is performed to identify 

completed. A variety of mechanisms are known for manag- th e sequence of the polymer. 

ing data movement between the disk and the integrated 5. The method of claim 3, wherein the experimental 

circuit memory element, and the invention is not limited 30 constraints applied to the polymer are different for each 

thereto. It should also be understood that the invention is not repetition. 

limited to a particular memory system. 5. The method of claim 1, wherein the experimental 

The invention is not limited to a particular computer constraint applied to the polymer is digestion with an 

platform, particular processor, or particular high-level pro- exoenzyme. 

gramming language. Additionally, the computer system may 35 7. The method of claim 1, wherein the experimental 

be a multiprocessor computer system or may include mul- constraint applied to the polymer is digestion with an 

tiple computers connected over a computer network. That endoenzyme. 

each module (e.g. 108, 112) in FIG. 1 may be separate 8. The method of claim 1, wherein the experimental 

modules of a computer program, or may be separate com- constraint applied to the polymer is selected from the group 

puter programs. Such modules may be operable on separate 40 consisting of restriction endonuclease digestion; chemical 

computers. Data (e.g. 102, 110, 114, 116, and 118) may be digestion; chemical modification; interaction with a binding 

stored in a memory system or transmitted between computer compound; chemical peeling; and enzymatic modification, 

systems. The invention is not limited to any particular 9. The method of claim 1, wherein the property of the 

implementation using software or hardware or firmware, or polymer is molecular weight. 

any combination thereof. The various elements of the 45 10. The method of claim 1, wherein the population of 

system, either individually or in combination, may be imple- polymers of chemical units includes every polymer 

mented as a computer program product tangibly embodied sequence having the molecular weight of the sample poly- 

in a machine-readable storage device for execution by a mer. 

computer processor. Various steps of the process may be U. The method of claim 1, wherein the population of 
performed by a computer processor executing a program 50 polymers of chemical units includes less than every polymer 
tangibly embodied on a computer-readable medium to per- sequence having the molecular weight of the sample poly- 
form functions by operating on input and generating output. mer. 

Computer programming languages suitable for implement- 12. The method of claim 1, wherein the step of detection 

ing such a system include procedural programming involves the use of mass spectrometry to determine the 

languages, object-oriented programming languages, and 55 molecular weight of the polymer. 

combinations of the two. 13. The method of claim 12, wherein the mass spectrom- 

The present invention is not to be limited in scope by etry is matrix assisted laser desorption ionization which 

examples provided, since the examples are intended as a detects molecular weight with an accuracy of approximately 

single illustration of one aspect of the invention. Various one Dalton. 

modifications of the invention in addition to those shown 60 14. The method of claim 1, wherein polymer is reduced to 
and described herein will become apparent to those skilled at least two fragments and the property of the polymer is the 
in the art from the foregoing description and fall within the size of the fragments and wherein the step of detection 
scope of the appended claims. The advantages and objects of involves strong ion exchange chromatography, 
the invention arc not necessarily encompassed by each 15. The method of claim 1, wherein the step of identifying 
embodiment of the invention. All references, patents and 65 includes selecting the population of polymers of chemical 
patent publications that are recited in this application are units from a database including molecular weights of poly- 
incorporated in their entirety herein by reference. mere of chemical units. 
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16. The method of claim 15, wherein the database 
includes identifiers corresponding to chemical units of a 
plurality of polymers, each of the identifiers including a field 
storing a value corresponding to a property of the corre- 
sponding chemical unit. 5 

17. A method for sequencing a polymer, comprising: 

(A) applying an experimental constraint to the polymer to 
modify the polymer, 

(B) detecting a property of the modified polymer; 

(C) identifying a population of polymers having the same 
molecular length as the sample polymer and having 
molecular weights similar to the molecular weight of 
the sample polymer; 

(D) identifying a subpopulation of the identified popula- 15 
tion of polymers having the same property as the 
modified polymer by eliminating, from the identified 
population of polymers, polymers having properties 
that do not correspond to the modified polymer; 

(E) repeating steps (A), (B), and (D) by applying addi- 20 
tional experimental constraints to the polymer and 
identifying additional subpopulations of polymers until 
the number of polymers within the subpopulation is one 
and the sequence of the polymer may be identified. 
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18. The method of claim 17, wherein the polymer is a 
polysaccharide. 

19. The method of claim 18, wherein the polysaccharide 
is a heparin-like-glycosaminoglycan. 

20. A method for identifying a subpopulation of polysac- 
charides having a property in common with a sample 
polysaccharide, comprising: 

(A) applying an experimental constraint to the polysac- 
charide to modify the polysaccharide; 

(B) detecting a property of the modified polysaccharide; 

(C) identifying a population of polysaccharides having the 
same molecular length as the sample polysaccharide; 
and 

(D) identifying a subpopulation of the identified popula- 
tion of polysaccharides having the same property as the 
modified polysaccharide by eliminating, from the iden- 
tified population of polysaccharides, polysaccharides 
having properties that do ^correspond to the modified 
polysaccharide. Ccft~ 

21. The method of claim 20, wherein the polysaccharide 
is a heparin-like-glycosaminoglycan. 

* * * * * 



